Search CORE

1,087 research outputs found

Bethe Projections for Non-Local Inference

Author: Belanger David
McCallum Andrew
Sheldon Daniel
Vilnis Luke
Publication venue
Publication date: 28/11/2016
Field of study

Many inference problems in structured prediction are naturally solved by augmenting a tractable dependency structure with complex, non-local auxiliary objectives. This includes the mean field family of variational inference algorithms, soft- or hard-constrained inference using Lagrangian relaxation or linear programming, collective graphical models, and forms of semi-supervised learning such as posterior regularization. We present a method to discriminatively learn broad families of inference objectives, capturing powerful non-local statistics of the latent variables, while maintaining tractable and provably fast inference using non-Euclidean projected gradient descent with a distance-generating function given by the Bethe entropy. We demonstrate the performance and flexibility of our method by (1) extracting structured citations from research papers by learning soft global constraints, (2) achieving state-of-the-art results on a widely-used handwriting recognition task using a novel learned non-convex inference procedure, and (3) providing a fast and highly scalable algorithm for the challenging problem of inference in a collective graphical model applied to bird migration.Comment: minor bug fix to appendix. appeared in UAI 201

arXiv.org e-Print Archive

CiteSeerX

Conservation theorems for the Cohesiveness Principle

Author: Belanger David R.
Publication venue
Publication date: 25/12/2022
Field of study

We prove that the Cohesiveness Principle (COH) is

\Pi^1_1

conservative over

RCA_0 + I\Sigma^0_n

and over

RCA_0 + B\Sigma^0_n

for all

n \geq 2

by recursion-theoretic means. We first characterize COH over

RCA_0 + B\Sigma^0_2

as a `jumped' version of Weak K\"{o}nig's Lemma (WKL) and develop suitable machinery including a version of the Friedberg jump-inversion theorem. The main theorem is obtained when we combine these with known results about WKL. In an appendix we give a proof of the

\Pi^1_1

conservativity of WKL over

RCA_0

by way of the Superlow Basis Theorem and a new proof of a recent jump-inversion theorem of Towsner

arXiv.org e-Print Archive

Synthesizing Normalized Faces from Facial Identity Features

Author: Belanger David
Cole Forrester
Freeman William T.
Krishnan Dilip
Mosseri Inbar
Sarna Aaron
Publication venue
Publication date: 17/10/2017
Field of study

We present a method for synthesizing a frontal, neutral-expression image of a person's face given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this invariance, we train our decoder network using only frontal, neutral-expression photographs. Since these photographs are well aligned, we can decompose them into a sparse set of landmark points and aligned texture maps. The decoder then predicts landmarks and textures independently and combines them using a differentiable image warping operation. The resulting images can be used for a number of applications, such as analyzing facial attributes, exposure and white balance adjustment, or creating a 3-D avatar

arXiv.org e-Print Archive

Crossref